Visual Reference Resolution using Attention Memory for Visual Dialog

نویسندگان

  • Paul Hongsuck Seo
  • Andreas Lehrmann
  • Bohyung Han
  • Leonid Sigal
چکیده

Visual dialog is a task of answering a series of inter-dependent questions given an input image, and often requires to resolve visual references among the questions. This problem is different from visual question answering (VQA), which relies on spatial attention (a.k.a. visual grounding) estimated from an image and question pair. We propose a novel attention mechanism that exploits visual attentions in the past to resolve the current reference in the visual dialog scenario. The proposed model is equipped with an associative attention memory storing a sequence of previous (attention, key) pairs. From this memory, the model retrieves previous attention, taking into account recency, that is most relevant for the current question, in order to resolve potentially ambiguous reference(s). The model then merges the retrieved attention with the tentative one to obtain the final attention for the current question; specifically, we use dynamic parameter prediction to combine the two attentions conditioned on the question. Through extensive experiments on a new synthetic visual dialog dataset, we show that our model significantly outperforms the state-of-the-art (by ≈ 16 % points) in the situation where the visual reference resolution plays an important role. Moreover, the proposed model presents superior performance (≈ 2 % points improvement) in the Visual Dialog dataset [1], despite having significantly fewer parameters than the baselines.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Effect of Neuropsychological Rehabilitation on Visual Memory Performance and Social Adjustment in Children with Attention Deficit Hyperactivity Disorder

Background and Objectives: One of the most common disorders in most societies is attention deficit hyperactivity disorder (ADHD). The purpose of this study was to investigate the effect of neuropsychological rehabilitation on visual memory performance and social adjustment in children with attention deficit hyperactivity disorder.   Methods: This research is a quasi-experimental study with pre...

متن کامل

Utilizing Visual Attention for Cross-Modal Coreference Interpretation

In this paper, we describe an exploratory study to develop a model of visual attention that could aid automatic interpretation of exophors in situated dialog. The model is intended to support the reference resolution needs of embodied conversational agents, such as graphical avatars and robotic collaborators. The model tracks the attentional state of one dialog participant as it is represented ...

متن کامل

Effectiveness of working memory intervention in behavior inhibition and visual working memory of children with Attention Deficit/ impulsive subtype Disorder (ADHD-I)

The aim of the current research study was to determine the effectiveness of working memory intervention on behavioral inhibition and visual working memory in children with symptoms of attention-deficit/impulsivity disorder in Selseleh city. The method was a quasi-experimental pre-test-post-test design with a control group. The statistical population consisted of all boy students of 8-12 years o...

متن کامل

Interaction of sensory experience and age in spatial memory performances

During a critical period of postnatal age sensory experience has a profound effect on maturation of visual cortical wiring. Electrophysiological evidence is indicating a substantial effect of visual deprivation on the visual cortical response properties. In this study we evaluated effect of light deprivation during a limited time of postnatal age on two aspects of spatial (working and reference...

متن کامل

Interaction of sensory experience and age in spatial memory performances

During a critical period of postnatal age sensory experience has a profound effect on maturation of visual cortical wiring. Electrophysiological evidence is indicating a substantial effect of visual deprivation on the visual cortical response properties. In this study we evaluated effect of light deprivation during a limited time of postnatal age on two aspects of spatial (working and reference...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017